Lexical Ontology Extraction Using Terminology Analysis
نویسندگان
چکیده
The majority of work described in this paper was conducted as part of the Recovering Evidence from Video by fusing Video Evidence Thesaurus and Video MetaData (REVEAL) project, sponsored by the UK’s Engineering and Physical Sciences Research Council (EPSRC). REVEAL is concerned with reducing the time-consuming, yet essential, tasks undertaken by UK Police Officers when dealing with terascale collections of video related to crime-scenes. The project is working towards technologies which will archive video that has been annotated automatically based on prior annotations of similar content, enabling rapid access to CCTV archives and providing capabilities for automatic video summarisation. This involves considerations of semantic annotation relating, amongst other things, to content and to temporal reasoning. In this paper, we describe the ontology extraction components of the system in development, and its use in REVEAL for automatically populating a CCTV ontology from analysis of expert transcripts of the video footage.
منابع مشابه
Pazienza University of Roma Tor Vergata , Italy Armando Stellato University of Roma Tor Vergata , Italy Semi - Automatic Ontology Development : Processes and Resources
The collection of the specialized vocabulary of a particular domain (terminology) is an important initial step of creating formalized domain knowledge representations (ontologies). Terminology Extraction (TE) aims at automating this process by collecting the relevant domain vocabulary from existing lexical resources or collections of domain texts. In this chapter, the authors address the extrac...
متن کاملCombination of endogenous clues for profiling inferred semantic relations: experiments with Gene Ontology
Acquisition and enrichment of lexical resources is acknowledged as an important research in the area of computational linguistics. While such resources are often missing, specialized domains, ie biomedicine, propose several structured terminologies. In this paper, we propose a high-quality method for exploiting a structured terminology and inferring elementary synonym lexicon. The method is bas...
متن کاملSemantic Interpretation of Terminological Strings
Terminology is the surface appearance of relevant domain concepts. Though many methods have been presented to extract from texts relevant domain terminology, a semantic interpretation of these terms is still left to ontology engineers. In this paper we present a method for term extraction and semantic interpretation, based on the use of corpora and existing lexical databases, such as WordNet.
متن کاملA protocol for constructing a domain-specific ontology for use in biomedical information extraction using lexical-chaining analysis
In order to do more semantics-based information extraction, we require specialized domain models. We develop a hybrid approach for constructing such a domain-specific ontology, which integrates key concepts from the protein-protein– interaction domain with the Gene Ontology. In addition, we present a method for using the domain-specific ontology in a discourse-based analysis module for analyzin...
متن کاملMining Multiword Terms from Wikipedia
The collection of the specialized vocabulary of a particular domain (terminology) is an important initial step of creating formalized domain knowledge representations (ontologies). Terminology Extraction (TE) aims at automating this process by collecting the relevant domain vocabulary from existing lexical resources or collections of domain texts. In this chapter, the authors address the extrac...
متن کاملHarvesting Ontologies from Open Domain Corpora: a Dynamic Approach
In this work we present a robust approach for dynamically harvesting domain knowledge from open domain corpora and lexical resources. It relies on the notion of Semantic Domains and provides a fully unsupervised method for terminology extraction and ontology learning. It makes use of an algorithm based on Conceptual Density to extract useful relations from WordNet. The method is efficient, accu...
متن کامل